Varieties of Regularities in Weighted Sequences
نویسندگان
چکیده
A weighted sequence is a string in which a set of characters may appear at each position with respective probabilities of occurrence. A common task is to identify repetitive motifs in weighted sequences, with presence probability not less than a given threshold. We consider the problems of finding varieties of regularities in a weighted sequence. Based on the algorithms for computing all the repeats of every length by using an iterative partitioning technique, we also tackle the all-covers problem and all-seeds problem. Both problems can be solved in O(n) time.
منابع مشابه
Searching for Regularities in Weighted Sequences
In this paper we describe algorithms for finding regularities in weighted sequences. A weighted sequence is a sequence of symbols drawn from an alphabet Σ that have a prespecified probability of occurrence. We show that known algorithms for finding repeats in solid sequences may fail to do so for weighted sequences. In particular, we show that Crochemore’s algorithm for finding repetitions cann...
متن کاملComputation of Repetitions and Regularities of Biologically Weighted Sequences
Biological weighted sequences are used extensively in molecular biology as profiles for protein families, in the representation of binding sites and often for the representation of sequences produced by a shotgun sequencing strategy. In this paper, we address three fundamental problems in the area of biologically weighted sequences: (i) computation of repetitions, (ii) pattern matching, and (ii...
متن کاملI-45: Advance MRI Sequences in Pelvic Endometriosis
Background: To assess MRI in diagnosing endometriotic lesions, emphasizing T2*weighted imaging efficacy. Materials and Methods: This prospective study of 48 females (22-38 years, average 29.6) clinically suspected of endometriosis from September 2009 to April 2012. MRI was performed with a 1.5 T imager (Siemens) with a body array coil. T1, T2 and T2* weighted (2D-FLASH) sequences were obtained ...
متن کاملWheat and barley seed system in Syria: How diverse are wheat and barley varieties and landraces from farmer’s fields?
"> The present study described the diversity of wheat and barley varieties andlandraces available in farmer’s fields in Syria using different indicators. Analysisof spatial and temporal diversity and coefficient of parentage along withmeasurements of agronomic and morphological traits were employed to explain thediversity of wheat and barley varieties or landraces grown by farmers in Syria.Farm...
متن کاملThe Weighted Suffix Tree: An Efficient Data Structure for Handling Molecular Weighted Sequences and its Applications
In this paper we introduce the Weighted Suffix Tree, an efficient data structure for computing string regularities in weighted sequences of molecular data. Molecular Weighted Sequences can model important biological processes such as the DNA Assembly Process or the DNA-Protein Binding Process. Thus pattern matching or identification of repeated patterns, in biological weighted sequences is a ve...
متن کامل